Performance of the modified Bark spectral distortion as an objective speech quality measure
نویسندگان
چکیده
The Modified Bark Spectral Distortion (MBSD), used for an objective speech quality measure, was presented previously [1]. The MBSD measure takes into account the noise masking threshold in order to use only audible distortions in the calculation of the distortion measure. Preliminary simulation results have shown improvement of the MBSD over the conventional BSD. In this paper, performance of the MBSD is reported in terms of frame sizes, speech classes, and spectral regions. The performance of the MBSD is not very sensitive to the frame size. The performance of the MBSD for voiced speech is almost the same as for non-silent speech. The high frequency region appears to play an important role in human perception of speech quality.
منابع مشابه
Comparison of two objective speech quality measures: MBSD and ITU-T Recommendation P.861
The Modified Bark Spectral Distortion (MBSD), used for an objective speech quality measure, was presented previously [1, 2]. The MBSD measure estimates speech distortion in loudness domain taking into account the noise masking threshold in order to include only audible distortions in the calculation of the distortion measure. Preliminary simulation results have shown improvement of the MBSD ove...
متن کاملImprovement of MBSD by scaling noise masking threshold and correlation analysis with MOS difference instead of MOS
The Modified Bark Spectral Distortion (MBSD), used for an objective speech quality measure, was presented previously [1][2]. The MBSD measure estimates speech distortion in the loudness domain taking into account the noise masking threshold in order to include only audible distortions in the calculation of the distortion measure. Preliminary simulation results have shown improvement of the MBSD...
متن کاملIncorporation of temporal masking effects into bark spectral distortion measure
The objective of this paper is to extend a promising objective speech distortion measurement method, the Bark Spectral Distance (BSD) measure, with the auditory concepts of forward and backward temporal masking to improve its measurement accuracy. The results of this investigation show that automatic BSD-based speech quality ratings may be made to correlate better with existing MOS ratings by r...
متن کاملComparative study of several distortion measures for speech recognition
In this study we compared several different spectral distortion measures including the Itakura-Saito (IS), the log likelihood ratio (LLR), the likelihood ratio (LR), the cepstral (CEP), and two perceptually based distortion measures, the weighted likelihood ratio (WLR) and the weighted slope metric (WSM) distortion measures, in terms of their effects on the performance of a standard dynamic tim...
متن کاملPerceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecs
Previous objective speech quality assessment models, such as bark spectral distortion (BSD), the perceptual speech quality measure (PSQM), and measuring normalizing blocks (MNB), have been found to be suitable for assessing only a limited range of distortions. A new model has therefore been developed for use across a wider range of network conditions, including analogue connections, codecs, pac...
متن کامل